Skip to main content
Internet Archive's 25th Anniversary Logo

Fix Broken Links Web Crawls

These crawls are part of an effort to archive pages as they are created and archive the pages that they refer to. That way, as the pages that are referenced are changed or taken from the web, a link to the version that was live when the page was written will be preserved.



rss RSS

141,125
RESULTS


Show sorted alphabetically

Show sorted alphabetically

SHOW DETAILS
up-solid down-solid
eye
Title
Date Reviewed
Creator
Wikipedia Near Real Time (from IRC)
Oct 18, 2021 Internet Archive
web

eye 138,298

favorite 0

comment 1

Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Fri Sep 11 00:20:52 PDT 2015 to Sat Sep 12 07:28:47 PDT 2015.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
Oct 18, 2021 Internet Archive
web

eye 485,271

favorite 0

comment 1

Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Mon Oct 6 10:54:53 PDT 2014 to Mon Oct 6 06:30:51 PDT 2014.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
Oct 18, 2021 Internet Archive
web

eye 285,139

favorite 0

comment 1

Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Jun 27 09:53:53 PDT 2015 to Sat Jun 27 05:08:56 PDT 2015.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
Oct 18, 2021 Internet Archive
web

eye 487,425

favorite 0

comment 1

Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Tue Oct 7 05:45:38 PDT 2014 to Tue Oct 7 01:43:03 PDT 2014.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
Oct 18, 2021 Internet Archive
web

eye 463,421

favorite 0

comment 1

Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Mon Oct 6 19:57:35 PDT 2014 to Mon Oct 6 15:23:09 PDT 2014.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
Oct 18, 2021 Internet Archive
web

eye 650,498

favorite 0

comment 1

Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 03:09:48 PDT 2013 to Fri Oct 11 21:36:24 PDT 2013.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
Oct 18, 2021 Internet Archive
web

eye 530,216

favorite 0

comment 1

Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Sep 21 05:32:27 PDT 2013 to Sat Sep 21 00:36:03 PDT 2013.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
Oct 18, 2021 Internet Archive
web

eye 344,652

favorite 0

comment 1

Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Mon Jun 1 09:58:44 PDT 2015 to Mon Jun 1 15:12:51 PDT 2015.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
Oct 18, 2021 Internet Archive
web

eye 255,588

favorite 0

comment 1

Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Sun Feb 23 18:35:32 PST 2014 to Sun Feb 23 12:08:35 PST 2014.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
Oct 18, 2021 Internet Archive
web

eye 64,621

favorite 0

comment 1

Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl856.us.archive.org:no404 from Sun Feb 10 02:59:08 PST 2019 to Sat Feb 9 20:53:51 PST 2019.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
Oct 18, 2021 Internet Archive
web

eye 216,827

favorite 0

comment 1

Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Mon Jan 6 17:40:55 PST 2014 to Mon Jan 6 11:37:50 PST 2014.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: no404, wordpress, crawldata
GDELT
Oct 18, 2021 Internet Archive
web

eye 44,902

favorite 0

comment 1

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 17 17:22:25 PDT 2016 to Fri Jun 17 11:23:26 PDT 2016.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Wikipedia Near Real Time (from IRC)
Oct 18, 2021 Internet Archive
web

eye 1.5M

favorite 0

comment 1

Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Tue Oct 7 09:36:28 PDT 2014 to Tue Oct 7 05:34:58 PDT 2014.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC)
Oct 18, 2021 Internet Archive
web

eye 133,365

favorite 0

comment 1

Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl896.us.archive.org:no404 from Tue Jun 19 15:39:21 PDT 2018 to Wed Jun 20 13:02:46 PDT 2018.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: no404, wikipedia, crawldata
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl108.us.archive.org:wordpress from Wed Aug 11 15:55:00 PDT 2021 to Wed Aug 11 08:58:51 PDT 2021.
Topics: no404, wordpress, crawldata
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
GDELT
web

eye 12

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 13:25:12 PDT 2017 to Fri Jun 9 06:52:16 PDT 2017.
Topic: crawldata
GDELT
web

eye 5

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 04:21:33 PDT 2017 to Thu Jun 8 21:43:17 PDT 2017.
Topic: crawldata
Topics: crawl, logs
Topics: crawl, logs
Topics: crawl, logs
GDELT
web

eye 7

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 03:11:23 PDT 2017 to Thu Jun 8 20:46:20 PDT 2017.
Topic: crawldata
Topics: crawl, logs
GDELT
web

eye 36

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Mon Jun 5 22:15:49 PDT 2017 to Mon Jun 5 16:12:09 PDT 2017.
Topic: crawldata
Topics: crawl, logs
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl105.us.archive.org:no404 from Fri Nov 13 20:22:49 PST 2020 to Fri Nov 13 12:36:58 PST 2020.
Topics: no404, wordpress, crawldata
GDELT
web

eye 30

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Wed Jun 7 16:51:20 PDT 2017 to Wed Jun 7 10:03:25 PDT 2017.
Topic: crawldata
Topics: crawl, logs
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl897.us.archive.org:wordpress from Fri Aug 20 10:24:40 PDT 2021 to Fri Aug 20 03:45:11 PDT 2021.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl897.us.archive.org:wordpress from Sat Aug 21 01:18:33 PDT 2021 to Fri Aug 20 18:25:06 PDT 2021.
Topics: no404, wordpress, crawldata
GDELT
web

eye 29

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sat Jun 3 20:50:23 PDT 2017 to Sat Jun 3 15:06:53 PDT 2017.
Topic: crawldata
Topics: crawl, logs
GDELT
web

eye 1,046

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Tue Jun 21 23:25:05 PDT 2016 to Tue Jun 21 18:01:36 PDT 2016.
Topic: crawldata
GDELT
web

eye 40

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri May 19 01:21:12 PDT 2017 to Thu May 18 18:29:03 PDT 2017.
Topic: crawldata
GDELT
web

eye 15,552

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl853.us.archive.org:gdelt from Mon Feb 18 03:54:19 PST 2019 to Sun Feb 17 21:01:24 PST 2019.
Topic: crawldata
GDELT
web

eye 27,528

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl853.us.archive.org:gdelt from Mon Feb 18 08:18:30 PST 2019 to Mon Feb 18 01:27:32 PST 2019.
Topic: crawldata
GDELT
web

eye 6,444

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Mon Feb 18 23:26:50 PST 2019 to Mon Feb 18 15:45:41 PST 2019.
Topic: crawldata
GDELT
web

eye 5,763

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Mon Feb 18 21:27:47 PST 2019 to Mon Feb 18 15:35:18 PST 2019.
Topic: crawldata
GDELT
web

eye 7,575

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Mon Feb 18 18:52:43 PST 2019 to Mon Feb 18 13:00:57 PST 2019.
Topic: crawldata
GDELT
web

eye 4,107

favorite 0

comment 0

Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Tue Feb 19 14:25:40 PST 2019 to Tue Feb 19 08:28:30 PST 2019.
Topic: crawldata